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RAW SEQUENCE LISTING DATE: 11/26/2004' 
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Input Set : A:\16336-13-2 seq listing.txt 
Output Set: N:\CRF4\11262004\J780507.raw 

3 <110> APPLICANT: MULL INS , James I. 

4 RODRIGO , Allen G. 

5 LEARN, Gerald H . 

6 LI , Fusheng 

7 NICKLE, David C. 

8 JENSEN, Mark A. 

10 <120> TITLE OF INVENTION: ANCESTRAL AND COT VIRAL SEQUENCES, PROTEINS AND 
COMPOSITIONS 

12 <130> FILE REFERENCE: 16336 -00132 0US 

14 <140> CURRENT APPLICATION NUMBER: 10/780,507 

15 <141> CURRENT FILING DATE: 2004-02-17 

17 <150> PRIOR APPLICATION NUMBER: US 10/204,204 

18 <151> PRIOR FILING DATE: 2001-02-16 

20 <150> PRIOR APPLICATION NUMBER: PCT/USOl/05288 

21 <151> PRIOR FILING DATE: 2001-02-16 

23 <150> PRIOR APPLICATION NUMBER: US 60/183,659 

24 <151> PRIOR FILING DATE: 2000-02-18 

26 <150> PRIOR APPLICATION NUMBER: US 60/447,586 

27 <151> PRIOR FILING DATE: 2003-02-14 
29 <160> NUMBER OF SEQ ID NOS : 125 

31 <170> SOFTWARE: Patentln version 3.1 

33 <210> SEQ ID NO: 1 

34 <211> LENGTH: 2652 

35 <212> TYPE: DNA 

36 <213> ORGANISM: Artificial Sequence 

38 <220> FEATURE : 

39 <223> OTHER INFORMATION: Ancestral HIV-1 group M, subtype B, env sequence 

41 <400> SEQUENCE: 1 

42 atgcgcgtga agggcatccg caagaactac cagcacctgt ggcgctgggg caccatgctg 60 
44 ctggggatgc tgatgatctg ctccgcggcc gagaagctgt gggtgaccgt gtactacggc 12 0 
46 gtgcccgtgt ggaaggaggc caccaccacc ctgttctgcg ccagcgacgc caaggcttac 180 
48 gacaccgagg tccacaacgt gtgggccacc cacgcctgcg tgcccaccga ccccaacccc 240 
50 caggaggtgg tgctggagaa cgtgaccgag aacttcaaca tgtggaagaa caacatggtg 300 
52 gagcagatgc acgaggacat catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag 360 
54 ttaacccccc tgtgcgtgac cctgaactgc accgacgacc tgcgcaccaa cgccaccaac 420 
56 accaccaaca gcagcgccac caccaacacc accagcagcg gcggcggcac gatggagggc 480 
58 gagaagggcg agatcaagaa ctgcagcttc aacgtgacca ccagcatccg cgacaagatg 54 0 
60 cagaaggagt acgccctgtt ctacaagctg gacgtggtgc ccatcgacaa cgacaacaac 600 
62 aacaccaaca acaacaccag ctaccgcctc atcaactgca acaccagcgt gatcacccag 660 
64 gcctgcccca aggtgagctt cgagcccatc cccatccact actgcacccc cgccggcttc 720 
66 gceatcctga agtgcaacga caagaagttc aacggcaccg gcccctgcac caacgtgagc 780 
68 accgtgcagt gcacccacgg catccgcccc gtggtgagca cccagctgct gctgaacggc 840 
70 agcctggccg aggaggaggt ggtgatccgc agcgagaact tcaccgacaa cgccaagacc 900 
72 atcatcgtgc agctgaacga gagcgtggag atcaactgca cgcgtcccaa caacaacacc 960 
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74 cgcaagagca tccccatcgg ccctggccgc gccctgtacg ccac.cggcaa gatcatcggc 1020 
76 gacatccgcc aggcccactg caacctgtcg cgagccaagt ggaacaacac cctgaagcag 1080 
78 atcgtgacca agctgcgcga gcagttcggc aacaacaaga ccaccatcgt gttcaaccag 1140 
80 agcagcggcg gcgaccccga gatcgtgatg cacagcttca actgcggcgg cgaattcttc 1200 
82 tactgcaaca gcacccagct gttcaacagc acctggcact tcaacggcac ctggggcaac 1260 
84 aacaacaccg agcgcagcaa caacgccgcc gacgacaacg acaccatcac cctgccctgc 132 0 
86 cgcatcaagc agatcatcaa catgtggcag gaggtgggca aggccatgta cgcccccccc 13 80 
88 atcagcggcc agatccgctg cagcagcaac atcaccggcc tgctgctgac tcgagacggc 1440 
90 ggcaacaacg agaacaccaa caacaccgac accgagatct tccgccccgg gggcggcgac 1500 
92 atgcgcgaca actggcgcag cgagctgtac aagtacaagg tggtgaagat cgagcccctg 1560 
94 ggcgtggccc ccaccaaggc caagcgccgc gtggtgcagc gcgagaagcg cgccgtgggc 162 0 
96 atgctgggcg ccatgttcct gggcttcctg ggcgccgccg gcagcaccat gggcgccgcc 1680 
98 agcatgaccc tgaccgtgca ggcccgccag ctgctgagcg gcatcgtgca gcagcagaac 174 0 
100 aacctgctgc gcgccatcga ggcccagcag cacctgctgc agctgaccgt gtggggcatc 1800 
102 aagcagctgc aggcccgcgt gctggccgtg gagcggtacc tgaaggacca gcagctgctg 1860 
104 ggcatctggg gctgcagcgg caagctgatc tgcaccaccg cggtgccctg gaacgccagc 1920 
106 tggagcaaca agagcctgga caagatctgg aacaacatga cctggatgga gtgggagcgc 1980 
108 gagatcgaca actacaccgg cctgatctac accctgatcg aggagagcca gaaccagcag 2040 
110 gagaagaacg agcaggagct gctggagctg gacaagtggg ccagcctgtg gaactggttc 2100 
112 gatatcacca actggctgtg gtacatcaag atcttcatca tgatcgtggg cggcctggtg 2160 
114 ggcctgcgca tcgtgttcgc cgtgctgagc atcgtgaacc gcgtgcgcca gggctacagc 2220 
116 cccctgagct tccagacccg cctgcccgcc ccccgcggcc ccgaccgccc cgagggcatc 2280 
118 gaggaggagg gcggcgagcg cgaccgcgac cgcagcgggc gcctggtgaa cggcttcctg 2340 
120 gccctgatct gggacgacct gcgcagcctg tgcctgttca gctaccaccg cctgcgcgac 2400 
122 ctgctgctga tcgtggcccg catcgtggag ctgctgggcc ggcgcggctg ggaggccctg 2460 
124 aagtattggt ggaacctgct gcagtactgg agccaggagc tgaagaacag cgccgtgagc 2 52 0 
126 ctgctgaacg ccaccgccat cgccgtggcc gagggcaccg accgcgtgat cgaggtggtg 2580 
128 cagcgcgcct gccgcgccat cctgcacatc ccccgccgca tccgccaggg cctggagcgc 2640 
13 0 gccctgctgt ga 2 652 

133 <210> SEQ ID NO: 2 

134 <211> LENGTH: 883 
13 5 <212> TYPE: PRT 

136 <213> ORGANISM: Artificial Sequence 

138 <220> FEATURE: 

139 <223> OTHER INFORMATION: Ancestral HIV-1 group M, subtype B, env sequence 
141 <400> SEQUENCE: 2 

143 Met Arg Val Lys Gly He Arg Lys Asn Tyr Gin His Leu Trp Arg Trp 

144 15 10 15 

147 Gly Thr Met Leu Leu Gly Met Leu Met He Cys Ser Ala Ala Glu Lys 

148 20 25 ^ 30 

151 Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 

152 35 40 45 

155 Thr Thr Leu Phe Cys Ala Ser Asp. Ala Lys Ala Tyr Asp Thr Glu Val 

156 50 55 60 

159 His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 

160 65 70 75 80 

163 Gin Glu Val Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys 

164 85 90 95 

167 Asn Asn Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp Asp 



file://C:\CRF4\Outhold\VsrJ780507.htm 



11/26/04 



Page 3 of 8 



RAW SEQUENCE LISTING DATE: 11/26/2004 

PATENT APPLICATION: US/10/780,507 TIME : 10:21:34 

Input Set : A:\16336-13-2 seq listing.txt 
Output Set: N:\CRF4\11262004\J780507.raw 



168 






100 










105 










110 






171 


Gin 


Ser 


Leu Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 


Pro 


Leu 


Cys 


Val 


Thr 


Leu 


172 






115 








120 










125 








175 


Asn 


Cys 


Thr Asp 


Asp 


Leu 


Arg 


Thr 


Asn 


Ala 


Thr 


Asn 


Thr 


Thr 


Asn 


Ser 


176 




130 








135 










140 










179 


Ser 


Ala 


Thr Thr 


Asn 


Thr 


Thr 


Ser 


Ser 


Gly 


Gly 


Gly 


Thr 


Met 


Glu 


Gly 


180 


145 








150 










155 










160 


183 


Glu 


Lys Gly Glu 


He 


Lys 


Asn 


Cys 


Ser 


Phe 


Asn 


Val 


Thr 


Thr 


Ser 


He 


184 








165 










170 










175 




187 


Arg 


Asp 


Lys Met 


Gin 


Lys 


Glu 


Tyr 


Ala 


Leu 


Phe 


Tyr 


Lys 


Leu 


Asp 


Val 


188 






180 










185 










190 






191 


Val 


Pro 


He Asp 


Asn 


Asp 


Asn 


Asn 


Asn 


Thr 


Asn 


Asn 


Asn 


Thr 


Ser 


Tyr 


192 






195 








200 










205 








195 


Arg 


Leu 


He Asn 


Cys 


Asn 


Thr 


Ser 


Val 


lie 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 


196 




210 








215 










220 










199 


Val 


Ser 


Phe Glu 


Pro 


lie 


Pro 


lie 


His 


Tyr 


Cys 


Thr 


Pro 


Ala 


Gly 


Phe 


200 


225 








230 










235 










240 


203 


Ala 


He 


Leu Lys 


Cys 


Asn 


Asp 


Lys 


Lys 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 


204 








245 










250 










255 




207 


Thr 


Asn 


Val Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


He 


Arg 


Pro 


Val 


Val 


208 






260 










265 










270 






211 


Ser 


Thr 


Gin Leu 


Leu 


Leu 


Asn 


Gly Ser Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


212 






275 








280 










285 








215 


He 


Arg 


Ser Glu 


Asn 


Phe 


Thr 


Asp Asn Ala 


Lys 


Thr 


lie 


lie 


Val 


Gin 


216 




290 








295 










300 










219 


Leu 


Asn 


Glu Ser 


Val 


Glu 


lie 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 


220 


305 








310 










315 










320 


223 


Arg 


Lys 


Ser He 


Pro 


He 


Gly 


Pro 


Gly Arg 


Ala 


Leu 


Tyr 


Ala 


Thr 


Gly 


224 








325 










330- 










335 




227 


Lys 


He 


He Gly 


Asp 


He 


Arg 


Gin 


Ala 


His 


Cys 


Asn 


Leu 


Ser 


Arg 


Ala 


228 






340 










345 










350 






231 


Lys 


Trp Asn Asn 


Thr 


Leu 


Lys 


Gin 


lie 


Val 


Thr 


Lys 


Leu 


Arg 


Glu 


Gin 


232 






355 








360 










365 








235 


Phe 


Gly Asn Asn 


Lys 


Thr 


Thr 


lie 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 


Gly 


236 




370 








375 










380 










239 


Asp 


Pro 


Glu He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly 


Glu 


Phe 


Phe 


240 


385 








390 










395 










400 


243 


Tyr 


Cys 


Asn Ser 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


His 


Phe 


Asn 


Gly 


244 








405 










410 










415 




247 


Thr 


Trp Gly Asn 


Asn 


Asn 


Thr 


Glu Arg Ser 


Asn 


Asn 


Ala 


Ala 


Asp 


Asp 


248 






420 










425 










430 






251 


Asn 


Asp 


Thr lie 


Thr 


Leu 


Pro 


Cys 


Arg 


lie 


Lys 


Gin 


lie 


He 


Asn 


Met 


252 






435 








440 










445 








255 


Trp 


Gin 


Glu Val 


Gly 


Lys 


Ala 


Met 


Tyr Ala 


Pro 


Pro 


He 


Ser 


Gly 


Gin 


256 




450 








455 










460 










259 


He 


Arg 


Cys Ser 


Ser 


Asn 


He 


Thr Gly Leu 


Leu 


Leu 


Thr 


Arg 


Asp 


Gly 


260 


465 








470 










475 










480 


263 


Gly 


Asn 


Asn Glu 


Asn 


Thr 


Asn 


Asn Thr Asp 


Thr 


Glu 


He 


Phe 


Arg 


Pro 


264 








485 










490 










495 





file://C:\CRF4\Outhold\VsrJ780507.htm 



1 1/26/04 



Page 4 of 8 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/7 80,507 



DATE: 11/26/2004 
TIME: 10:21:34 



Input Set : 
Output Set : 



A:\16336-13-2 seq listing.txt 
N:\CRF4\11262004\J780507 .raw 



26 7 Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
268 500 505 510 

271 Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys 

272 515 520 525 

2 75 Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Met Leu Gly Ala 
276 530 535 540 

2 79 Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala 
280 545 550 555 560 

2 83 Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He Val 
284 565 570 575 

287 Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu 

288 580 585 590 

291 Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu 

292 595 600 605 

2 95 Ala Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly lie Trp Gly 
296 610 615 620 

299 Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala Ser 

300 625 630 635 640 

3 03 Trp Ser Asn Lys Ser Leu Asp Lys He Trp Asn Asn Met Thr Trp Met 
304 645 650 655 

307 Glu Trp Glu Arg Glu lie Asp Asn Tyr Thr Gly Leu He Tyr Thr Leu 

308 660 665 670 

311 He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu 

312 675 680 685 

315 Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp He Thr Asn 

316 690 695 700 

319 Trp Leu Trp Tyr He Lys He Phe He Met He Val Gly Gly Leu Val 

320 705 710 715 720 

323 Gly Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 

324 725 730 735 

327 Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Ala Pro Arg 

328 740 745 750 

331 Gly Pro Asp Arg Pro Glu Gly lie Glu Glu Glu Gly Gly Glu Arg Asp 

332 755 760 765 

335 Arg Asp Arg Ser Gly Arg Leu Val Asn Gly Phe Leu Ala Leu lie Trp 

336 770 775 780 

339 Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp 

340 785 790 795 800 

343 Leu Leu Leu lie Val Ala Arg lie Val Glu Leu Leu Gly Arg Arg Gly 

344 805 810 815 

34 7 Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp Ser Gin 
348 ' 820 825 830 

351 Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr Ala He Ala 

352 835 840 845 

355 Val Ala Glu Gly Thr Asp Arg Val He Glu Val Val Gin Arg Ala Cys 

356 850 855 860 

359 Arg Ala lie Leu His lie Pro Arg Arg He Arg Gin Gly Leu Glu Arg 

360 865 870 875 880 
363 Ala Leu Leu 
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Output Set: N:\CRF4\11262004\J780507.raw 

367 <210> SEQ ID NO: 3 

368 <211> LENGTH: 2562 

369 <212> TYPE: DNA 

370 <213> ORGANISM: Artificial sequence 

372 <220> FEATURE: 

373 <223> OTHER INFORMATION: Ancestral HIV-1 group M, subtype C, env sequence 
3 75 <400> SEQUENCE: 3 

376 atgcgggtga tgggcatcct gcggaactgc cagcagtggt ggatctgggg catcctgggc 60 

378 ttctggatgc tgatgatctg cagcgtgatg ggcaacctgt gggtgaccgt gtactacggc 120 

380 gtgcccgtgt ggaaggaggc caagaccacc ctgttctgcg ccagcgacgc caaggcctac 180 

3 82 gagcgggagg tgcacaacgt gtgggccacc cacgcctgcg tgcccaccga ccccaacccc 240 

384 caggagatgg tgctggagaa cgtgaccgag aacttcaaca tgtggaagaa cgacatggtg 300 

386 gaccagatgc acgaggacat catcagcctg tgggaccaga gcctgaagcc ctgcgtgaag 360 

388 ctgacccccc tgtgcgtgac cctgaactgc accaacgtga ccaacaccaa caacaacaac 42 0 

3 90 aacaccagca tgggcggcga gatcaagaac tgcagcttca acatcaccac cgagctgcgg 480 

3 92 gacaagaagc agaaggtgta cgccctgttc taccggctgg acatcgtgcc cctgaacgag 540 

3 94 aacagcaaca gcaacagcag cgagtaccgg ctgatcaact gcaacaccag cgccatcacc 600 

396 caggcctgcc ccaaggtgag cttcgacccc atccccatcc actactgcgc ccccgccggc 660 

3 98 tacgccatcc tgaagtgcaa caacaagacc ttcaacggca ccggcccctg caacaacgtg 72 0 
400 agcaccgtgc agtgcaccca cggcatcaag cccgtggtga gcacccagct gctgctgaac 780 
402 ggcagcctgg ccgaggagga gatcatcatc cggagcgaga acctgaccaa caacgccaag 840 
404 accatcatcg tgcacctgaa cgagagcgtg gagatcgtgt gcacccggcc caacaacaac 900 
406 acccggaaga gcatccggat cggccccggc cagaccttct acgccaccgg cgacatcatc 960 

4 08 ggcgacatcc ggcaggccca ctgcaacatc agcgagaagg agtggaacaa gaccctgcag 102 0 
410 cgggtgggca agaagctgaa ggagcacttc cccaacaaga ccatcaagtt cgagcccagc 108 0 
412 agcggcggcg acctggagat caccacccac agcttcaact gccggggcga gttcttctac 1140 
414 tgcaacacca gcaagctgtt caacagcacc tacaacagca ccaacaacgg caccaccagc 1200 
416 aacagcacca tcaccctgcc ctgccggatc aagcagatca tcaacatgtg gcagggcgtg 1260 
418 ggccgggcca tgtacgcccc ccccatcgcc ggcaacatca cctgcaagag caacatcacc 1320 
420 ggcctgctgc tgacccggga cggcggcaac accaacaaca ccaccgagac cttccggccc 1380 
422 ggcggcggcg acatgcggga caactggcgg agcgagctgt acaagtacaa ggtggtggag 1440 
424 atcaagcccc tgggcgtggc ccccaccgag gccaagcggc gggtggtgga gcgggagaag 1500 
426 cgggccgtgg gcatcggcgc cgtgttcctg ggcttcctgg gcgccgccgg cagcaccatg 1560 

42 8 ggcgccgcca gcatcaccct gaccgtgcag gcccggcagc tgctgagcgg catcgtgcag 162 0 
430 cagcagagca acctgctgcg ggccatcgag gcccagcagc acatgctgca gctgaccgtg 1680 
432 tggggcatca agcagctgca gacccgggtg ctggccatcg agcggtacct gaaggaccag 1740 
434 cagctgctgg gcatctgggg ctgcagcggc aagctgatct gcaccaccgc cgtgccctgg 1800 

43 6 aacagcagct ggagcaacaa gagccaggac gacatctggg acaacatgac ctggatgcag 1860 
43 8 tgggaccggg agatcagcaa ctacaccgac accatctacc ggctgctgga ggacagccag 1920 
440 aaccagcagg agaagaacga gaaggacctg ctggccctgg acagctggaa gaacctgtgg 1980 
442 aactggttcg acatcaccaa ctggctgtgg tacatcaaga tcttcatcat gatcgtgggc 2040 
444 ggcctgatcg gcctgcggat catcttcgcc gtgctgagca tcgtgaaccg ggtgcggcag 210 0 
446 ggctacagcc ccctgagctt ccagaccctg acccccaacc cccggggccc cgaccggctg 2160 
448 ggcggcatcg aggaggaggg cggcgagcag gaccgggacc ggagcatccg gctggtgagc 2220 
450 ggcttcctgg ccctggcctg ggacgacctg cggagcctgt gcctgttcag ctaccaccgg 22 80 
4 52 ctgcgggact tcatcctgat cgccgcccgg ggcgtgaacc tgctgggccg gagcagcctg 234 0 
454 cggggcctgc agcggggctg ggaggccctg aagtacctgg gcagcctggt gcagtactgg 24 0 0 
456 ggcctggagc tgaagaagag cgccatcagc ctgctggaca ccatcgccat cgccgtggcc 2460 . 
458 gagggcaccg accggatcat cgagctggtg cagcggatct gccgggccat ccggaacatc 2520 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 11/26/2004 

PATENT APPLICATION: US/10/780,507 TIME: 10:21:35 




put Set : A:\16336-13-2 seq listing.txt 
tutput Set: N:\CRF4\11262004\J780507.raw 



Invalid Line Length. 

The rules require that a line not exceed 72 characters in length. This includes spaces. 



Seq#:l; Line(s) 10 
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VERIFICATION SUMMARY DATE : 11/26/2004 

PATENT APPLICATION: US/10/780,507 TIME: 10:21:35 

Input Set : A:\16336-13-2 seq listing.txt 
Output Set: N:\CRF4\11262004\J780507.raw 
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